NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ReasonBERT: Pre-trained to Reason with Distant Supervision

https://doi.org/10.18653/v1/2021.emnlp-main.494

Deng, Xiang; Su, Yu; Lees, Alyssa; Wu, You; Yu, Cong; Sun, Huan (November 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

We present ReasonBert, a pre-training method that augments language models with the ability to reason over long-range relations and multiple, possibly hybrid contexts. Unlike existing pre-training methods that only harvest learning signals from local contexts of naturally occurring texts, we propose a generalized notion of distant supervision to automatically connect multiple pieces of text and tables to create pre-training examples that require long-range reasoning. Different types of reasoning are simulated, including intersecting multiple pieces of evidence, bridging from one piece of evidence to another, and detecting unanswerable cases. We conduct a comprehensive evaluation on a variety of extractive question answering datasets ranging from single-hop to multi-hop and from text-only to table-only to hybrid that require various reasoning capabilities and show that ReasonBert achieves remarkable improvement over an array of strong baselines. Few-shot experiments further demonstrate that our pre-training method substantially improves sample efficiency.
more » « less
Full Text Available
Perturbation-based Detection and Resolution of Cherry-picking

Asudeh, Abolfazl; Wu, You; Yu, Cong; Jagadish, H. V. (September 2021, A Quarterly bulletin of the Computer Society of the IEEE Technical Committee on Data Engineering)
Wang, Haixun; Li, Chengkai; Yang, Jun (Ed.)
In settings where an outcome, a decision, or a statement is made based on a single option among alternatives, it is popular to cherry-pick the data to generate an outcome that is supported by the cherry-picked data but not in general. In this paper, we use perturbation as a technique to design a support measure to detect, and resolve, cherry-picking across different contexts. In particular, to demonstrate the general scope of our proposal, we study cherry picking in two very different domains: (a) political statements based on trend-lines and (b) linear rankings. We also discuss sampling-based estimation as an effective and efficient approximation approach for detecting and resolving cherry-picking at scale.
more » « less
Full Text Available
Perturbation-based Detection and Resolution of Cherry-picking

Asudeh, Abolfazl; Wu, Yu; Yu, Cong; Jagadish, H.V. (January 2021, A Quarterly bulletin of the Computer Society of the IEEE Technical Committee on Data Engineering)

Full Text Available
TURL: Table Understanding through Representation Learning

https://doi.org/10.14778/3430915.3430921

Deng, Xiang; Sun, Huan; Lees, Alyssa; Wu, You; Yu, Cong (January 2021, Proceedings of the VLDB Endowment)
null (Ed.)
Full Text Available
On detecting cherry-picked trendlines

https://doi.org/10.14778/3380750.3380762

Asudeh, Abolfazl; Jagadish, H. V.; Wu, You; Yu, Cong (February 2020, Proceedings of the VLDB Endowment)

Full Text Available
Introduction to the Special Issue on Combating Digital Misinformation and Disinformation

https://doi.org/10.1145/3321484

Hassan, Naeemul; Li, Chengkai; Yang, Jun; Yu, Cong (July 2019, Journal of Data and Information Quality)

Full Text Available
Automated Pop-Up Fact-Checking: Challenges & Progress

Adair, Bill; Li, Chengkai; Yang, Jun; Yu, Cong. (January 2019, Proceedings of the Computation + Journalism Symposium)

Full Text Available
Query Perturbation Analysis: An Adventure of Database Researchers in Fact-Checking

Yang, Jun; Agarwal, Pankaj K; Roy, Sudeepa; Walenz, Brett; Wu, You; Yu, Cong; Li, Chengkai. (September 2018, A Quarterly bulletin of the Computer Society of the IEEE Technical Committee on Data Engineering)

Full Text Available
Investigating Rumor News Using Agreement-Aware Search

https://doi.org/10.1145/3269206.3272020

Shang, Jingbo; Shen, Jiaming; Sun, Tianhang; Liu, Xingbang; Gruenheid, Anja; Korn, Flip; Lelkes, Adam D.; Yu, Cong; Han, Jiawei (October 2018, Proceedings of the 27th {ACM} International Conference on Information and Knowledge Management, {CIKM} 2018)

Full Text Available

Search for: All records